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Kaufman, Claire 



From: 

Sent: 

To: 

Subject: 



Kaufman, Claire 

Wednesday, August 13, 2003 10:50 AM 
ST I C- B iot ech/C h e m Li b 
sequence search 09/990,940 



SEQUENCE SEARCH REQUEST 



NAME: CLAIRE KAUFMAN AU: 1646 MAILBOX: 10D19 
SERIAL NUMBER: 09/990,940 DATE: 8/1 3/03 



Please search SEQ ID NO: 17 and 18 and 
oligo search both 17 and 18 (I need 50 contiguous amino acids or 100 contiguous 
nucleotides). Please search in commercial and interference database. 

Please put results on disk. 

Thanks! 



Claire Kaufman 
AU 1646, 305-5791 
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AC Q8BHH0; PRELIMINARY; PRT; 365 AA 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical rhodopsin-like GPCR superf amily containing protein. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Dorsal root ganglion, and Head; 

RX MEDLINE=22354683; PubMed=12 4 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60, 77 0 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK048439; BAC33337.1; -. 

DR EMBL; AK051723; BAC34735.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 365 AA; 41759 MW; 1EB7E5369632ED56 CRC64; 



Query Match 88.2%; Score 1905; DB 11; Length 365; 

Best Local Similarity 100.0%; Pred. No. 8.6e-160; 

Matches 365; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 


52 


ML I FALAL FGNALVVYVVT RS KAMRT VTN I FI C S LAL S DLL I VF FC I P VTMLQNVS DT WL 


111 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


ML I FALAL FGNALVVYVVT RS KAMRT VTN I FI CS LALS DLL I VFFC I P VTMLQNVS DTWL 


60 


Qy 


112 


GGAFICKMVPFVQCTAIVTEILTMTCIAVERHQGLVHPFKMKRQYTNQRAFTMLGVVWLV 


171 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


GGAFICKMVPFVQCTAIVTEILTMTCIAVERHQGLVHPFKMKRQYTNQRAFTMLGWWLV 


120 


Qy 


172 


AIIIGSPMWHVQRLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSV 


231 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 




Db 


121 


AIIIGSPMWHVQRLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSV 


180 


Qy 


232 


LYGKIGYELWIKKRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIV 


291 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LYGKIGYELWIKKRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIV 


240 


Qy 


292 


HMMIEYSNFEKEYDEVTI KMI FAIVQI I GFFNS ICNPI I YALMNENFKKNFVSAVCYCI V 


351 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


HMMIEYSNFEKEYDEVTI KMI FAIVQI I GFFNS ICNPI I YALMNENFKKNFVSAVCYCIV 


300 


Qy 


352 


KETPSSARKHGSSGAMVMHRRAKLAARENPVEIKGEAFGGSNIDIKWCEQPEKKKRRSKV 


411 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


301 


KETPSSARKHGSSGAMVMHRRAKLAARENPVEIKGEAFGGSNIDIKWCEQPEKKKRRSKV 


360 


Qy 


412 


ASCPL 416 




Db 


361 


1 1 1 1 1 

ASCPL 365 
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22-AUG-2000 (first entry) 

Human G protein coupled receptor hRUP4 (V272K) protein SEQ ID NO: 128. 

Human; G protein coupled receptor; GPCR; transmembrane receptor; 
identification; agonist; screening; therapeutic; pharmaceutical; 
mutant . 



Homo sapiens . 
Synthetic . 



WO200022131-A2. 



AC AAB02853; 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 



20 


-APR- 


2000 






13 


-OCT- 


1999; 99WO- 


US24065. 


13 


-OCT- 


1998 


; 98US- 


0170496. 


12 


-NOV- 


1998 


? 98US- 


0108029. 


20 


-NOV- 


1998 


; 98US- 


0109213. 


27 


-NOV- 


1998 


; 98US- 


0110060. 


16 


-FEB- 


1999 


; 99US- 


0120416. 


26 


-FEB- 


1999 


; 99US- 


0121852. 


12 


-MAR- 


1999 


; 99US- 


0123944. 


12 


-MAR- 


1999 


; 99US- 


0123945. 


12 


-MAR- 


1999 


? 99US- 


0123946. 


12 


-MAR- 


1999 


? 99US- 


0123948. 


12 


-MAR- 


1999, 


f 99US- 


0123949. 


12 


-MAR- 


1999, 


f 99US- 


0123951. 


28 


-MAY- 


1999, 


f 99US- 


0136436. 


28 


-MAY- 


1999, 


f 99US- 


0136437. 


28 


-MAY- 


1999, 


99US- 


0136439. 


28 


-MAY- 


1999, 


99US- 


0137127 . 


28 


-MAY- 


1999, 


99US- 


0137131. 


28 


-MAY- 


1999, 


99US- 


0137567 . 


30 


-JUN- 


1999, 


99US- 


0141448. 


27 


-AUG- 


1999, 


99US- 


0151114. 


03 


-SEP- 


1999, 


99US- 


0152524 . 


29 


-SEP- 


1999, 


99US- 


0156633. 


29 


-SEP- 


1999, 


99US- 


0156555. 


29 


-SEP- 


1999, 


99US- 


0156634. 



(AREN-) ARENA PHARM INC. 

Behan DP, Lehmann-Bruinsma K, Chalmers DT, Chen R, Dang HT; 
Gore M, Liaw CW, Lin I, Lowitz K, White C; 

WPI; 2000-317986/27. 
N-PSDB; AAA46115. 



Non-endogenous, human G protein-coupled receptors for screening 
receptor, inverse or partial agonists useful as therapeutic agents 
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xx 

PS Example 2; Page 164-166; 187pp; English. 
XX 

CC The present invention describes transmembrane receptors, preferably 

CC human G protein coupled receptors (GPCR) , for which the endogenous 

CC ligand is unknown (orphan GPCR receptors) . More specifically the present 

CC invention relates to non-endogenous, constitutively activated versions 

CC of a human GPCR. These non-endogenous human GPCRs can be useful for 

CC the direct identification of candidate compounds as receptors agonists, 

CC inverse agonists or partial agonists for use as pharmaceutical agents. 

CC AAA46017 to AAA46126 and AAB02825 to AAB02859 represent sequences used in 

CC the exemplification of the present invention. 

XX 

SQ Sequence 431 AA; 



Query Match 81.4%; Score 1758; DB 21; Length 431; 

Best Local Similarity 79.7%; Pred. No. 1.2e-187; 

Matches 337; Conservative 37; Mismatches 39; Indels 10; Gaps 

3; 



Qy 4 NLTAEQLSALLRLHNLTRAQFIAHYGLRPLVLTPQLPARARLALLLVGMLIFALALFGNA 63 

I : I II I III I I I I I I I I I I I I I I I I I : I I I I : I I I : I I : I I I I I I I I I II 

Db 5 NITPEQFSRLLRDHNLTREQFIALYRLRPLVYTPELPGRAKLALVLTGVLIFALALFGNA 64 

Qy 64 LWYVWRSKAMRTVTNIFICSIALSDLLIVFFCIP 123 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I : I I I I I I I I I I I I I I I I 

Db 65 LVFYWTRSKAMRTWNIFICSIA^ 124 



Qy 124 QCTAIVTEILTMTCIAVERHQGLVTIPFKMKRQYTNQRAFTMLGVWLVAIIIGSPMWHVQ 183 

I I I : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : I : I I I I I I I I 
Db 125 Q S TAWT EMLTMT C I AVE RHQ GLVH P FKMKWQ YTNRRAFTMLGVVWLVAVI VGS PMWHVQ 184 

Qy 184 RLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSVLYGKIGYELWIK 243 

: I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I MINI::: : I I I I I I I I I I I 
Db 185 QLEIKYDFLYEKEHICCLEEWTSPVHQKIYTTFILVILFLLPLMVMLILYSKIGYELWIK 244 



Qy 244 KRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIVHMMIEYSNFEKE 303 

I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I I I 
Db 245 KRVGDGSVLRT I HGKEMS KI ARKKKRAKIMMVT WAL FAVCWAP FHWHMMI E YSN FEKE 304 



Qy 304 YDEVTIKMIFAIVQIIGFFNSICNPIIYALMNENFKKNFVSAVCYCIVKETPSSARKHGS 363 

I I : I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I : I I I I I I I I : I I I : : I I : 
Db 305 YDDVTIKMIFAIVQIIGFSNSICNPIVYAFMNENFKKNVLSAVCYCIVNKTFSPAQRHGN 364 



Qy 364 SGAMVMHRRAKLAARENPV-EIKGEAFGGSNIDIKWCEQPEKKKR RSKVA-S 413 

II : I :: I I : I I I I I I I I I I I I I : : I I I I I : I I : I I : : I : 

Db 365 SGITMMRKKAKFSLRENPVEETKGEAFSDGNIEVKLCEQTEEKKKLKRHLALFRSELAEN 424 



Qy 



Db 



414 CPL 416 
I I 

425 SPL 427 



40 



50 



55 



neuropeptide Y/peptide YY receptor Y2 - human 

N;Alternate names: neuropeptide y/peptide YY receptor type 2 
, C; Species: Homo sapiens (man) 

C;Date: Ol-Mar-1996 #sequence_revision Ol-Mar-1996 #text_change 20-Apr-2000 
5 C;Accession: 139187; 139163; G02301 

R;Gerald, C; Walker, M.W. ; Vaysse, P.J. 
J. Biol. Chem. 270, 26758-26761, 1995 

A;Title: Expression cloning and pharmacological characterization of a human 

hippocampal neuropeptide Y/peptide YY Y2 receptor subtype. 
10 A;Reference number: 139187; MUID : 96070760 ; PMID:7592910 

A;Accession: 139187 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-381 <GER> 
15 A; Cross-references: EMBL:U36269; NID : gl063633; PIDN : AAC50281 . 1 ; PID:gl063634 

R ; Rose , P.M.; Fernandes , P . ; Lynch , J . S . ; Frazier , S . T . ; Fisher , S . M . ; 

Kodukula, K. ; Kienzle, B. ; Seethala, R. 

J. Biol. Chem. 270, 22661-22664, 1995 

A; Title: Cloning and functional expression of a cDNA encoding a human type 2 
20 neuropeptide Y receptor. 
A; Gene: GDB : NPY2R 

A;Cross-references: GDB:4365607; OMIM:162642 

A;Map position: 4q31-4q31 

C; Superf amily : neurokinin 1 receptor 
25 F;49-76/Domain: transmembrane #status predicted <TM1> 

F; 8 7- 113/ Domain : transmembrane #status predicted <TM2> 

F; 166-18 6/ Domain : transmembrane #status predicted <TM4> 

F;221-237/Domain: transmembrane #status predicted <TM5> 

F;269-291/Domain: transmembrane #status predicted <TM6> 
30 F;305-328/Domain: transmembrane #status predicted <TM7> 

F; 12 3-2 03/ Disulfide bonds: #status predicted 

F; 342/Binding site: palmitate (Cys) (covalent) #status predicted 
F; 372/Binding site: carbohydrate (Asn) (covalent) #status predicted 

35 Query Match 20.5%; Score 442; DB 2; Length 381; 

Best Local Similarity 30.7%; Pred. No. 3.7e-30; 

Matches 96; Conservative 77; Mismatches 106; Indels 34; Gaps 



45 : | | : | : | | : | | : : I : I I I : : I I 



9; 




Qy 


44 


Db 


50 


Qy 


104 


Db 


110 


Qy 


164 


Db 


168 


Qy 


218 


Db 


224 


Qy 


276 



I : I I I : I I : I I : : II : I : I I I I I I I I : I I : : I I I : hi I 



: I : I : : : : I I : : I I II I : I I : I 



LVTLFLLPLLLLSVLYGKIGYELW— IKKRIGDGSVLRTIHGKEMFKIARKKKRAVIMMV 275 
I : I : : I I I : : I I : I I : I : I : I : : : : : hi 

LLILYVLPLGIISFSYTRI WSKLKNHVSPGAANDHYH QRRQKTTKMLV 271 

TVWL FAVCWAP FH I VHMMI E YSN FE KEYDEVTIKMIFAIVQIIGFFNSICNPIIYA 332 

11:111 I I I : : : : III I : I I : II : : I h : I 



272 CWWFAVSWLPLHAFQLAVDIDSQVLDLKEY KLIFTVFHIIAMCSTFANPLLYG 326 

333 LMNEN FKKN FVS A 345 8/9=88% identical 

II I : : I I : I I 
327 WMNSNYRKAFLSA 339 



Application/Control Number: 09/990 5 940 Page $ 

ArtUnit:1646 ^t()\)^NCC Q/VwP^/SdVV/ 

AC Q8BHH0; PRELIMINARY; PRT; 365 AA 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

5 DE Hypothetical rhodopsin-like GPCR superfamily containing protein. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata ; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

10 RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=C57BL/6J; TISSUE=Dorsal root ganglion, and Head; 

RX MEDLINE=22354 683; PubMed=12466851 ; 

RA The FANTOM Consortium, 

15 RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563- 573 (200 2) 

DR EMBL; AK048439; 

20 DR EMBL; AK051723; 



BAC33337 
BAC34735 



T71- 



KW Hypothetical protein. 

SQ SEQUENCE 365 AA; 41759 MW; 1EB7E5369632ED56 CRC64; 

Query Match 88.2%; Score 1905; DB 11; Length 365; 

25 Best Local Similarity 100.0%; Pred. No. 8.6e-160; 

Matches 365; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

ML I FALAL FGNAL VVYVVT RS KAMRT VTN I FI C S LAL S DL L I VF FC I P VTMLQNVS DTWL 
I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
30 Db 1 ML I FALAL FGNAL VVYVVT RS KAMRT VTN I FI C S LAL S DL L I VF FC I P VTMLQNVS DTWL 60 

GGAFI CKMVP FVQCTAI VT E I LTMTC I AVERHQGLVH P FKMKRQ YTNQRAFTMLGWWLV 17 1 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
GGAFI CKMVP FVQCTAI VTEILTMTCIAVERHQGLVHP FKMKRQ YTNQRAFTMLGWWLV 120 

35 

AIIIGSPMWHVQRLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSV 231 

I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AIIIGSPMWHVQRLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSV 180 

40 Qy 232 LYGKIGYELWIKKRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIV 291 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LYGKIGYELWIKKRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIV 240 

HMMIEYSNFEKEYDEVTIKMIFAIVQIIGFFNSICNPII YALMNENFKKNFVSAVCYCIV 351 
45 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

HMMIEYSNFEKEYDEVTIKMIFAIVQIIGFFNSICNPI I YALMNENFKKNFVSAVCYCIV 300 

KETPS SARKHGS S GAMVMHRRAKLAARENPVEI KGEAFGGSNI DI KWCEQPEKKKRRS KV 411 

I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
50 Db 301 KETPS SARKHGS S GAMVMHRRAKLAARENPVEI KGEAFGGSNI DI KWCEQPEKKKRRS KV 360 

ASCPL 
I I I I I 



Qy 


52 


Db 


1 


Qy 


112 


Db 


61 


Qy 


172 


Db 


121 


Qy 


232 


Db 


181 


Qy 


292 


Db 


241 


Qy 


352 


Db 


301 


Qy 


412 


Db 


361 
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AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 



AAY71309; 

02-NOV-2000 (first entry) 

Human orphan G protein-coupled receptor hRUP4 . 

Human; orphan G protein-coupled receptor; GPCR; hRUP4; drug screening; 
transmembrane receptor; expressed sequence tag; EST; signal cascade. 

Homo sapiens . 

WO200031258-A2, 



02-JUN-2000, 



13-OCT-1999; 



99WO-US23687. 



20-NOV- 
16-FEB- 
26-FEB- 
12-MAR- 
12-MAR- 
2 8-MAY- 
2 8-MAY- 
2 8-MAY- 
2 8 -MAY - 

28- MAY- 
2 8-MAY- 
2 9-JUN- 

29- SEP- 
29-SEP- 
29-SEP- 
29-SEP- 
01-OCT- 
01-OCT- 
01-OCT- 
01-OCT- 
01-OCT- 
12-OCT- 
12-OCT- 



1998 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 



98US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 



0109213. 
0120416. 
0121852. 
0123946. 
0123949. 
0136436. 
0136437. 
0136439. 
0136567. 
0137127. 
0137131. 
0141448. 
0156555. 
0156633. 
0156634. 
0156653. 
0157280. 
0157281. 
0157282. 
0157293. 
0157294. 
0416760. 
0417044. 



(AREN-) ARENA PHARM INC. 

Chen R, Dang HT, Liaw CW, Lin I; 

WPI; 2000-400068/34. 
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Pagel? 



DR N-PSDB; AAD01136. 
XX 

PT Novel human orphan G protein-coupled receptors and the encoding cDNAs 

PT for use in the identification of G protein-coupled receptor agonists - 
XX 

PS Claim 74; Page 89-91; 102pp; English. 
XX 

CC The present amino acid sequence is the hRUP4, an endogenous human 

CC orphan G protein-coupled receptor (GPCR) . The full length hRUP4 cDNA was 

CC cloned by RT-PCR with human brain cDNA as template. The hRUP4 PCR 

CC fragment obtained was an alternatively spliced form of the EST (expressed 

CC sequence tag) clone AI307658. The orphan GPCR of the invention, like 

CC all GPCRs has seven transmembrane alpha helices with an extracellular 

CC N-terminus and an intracellular C-terminus . However, no endogenous 

CC ligands has yet been identified for the proteins of the invention. The 

CC orphan GPCRs may be used in the identification of their endogenous 

CC ligands, and to screen potential GPCR agonists and antagonists for use as 

CC pharmaceutical agents. The proteins may also be used in the study of 

CC GPCR-mediated signalling cascades, and to elucidate their precise role in 

CC normal and diseased human conditions. Nucleic acid encoding human orphan 

CC GPCRs may be used for tissue localisation expression analysis to provide 

CC information about their function in healthy and pathological states. 

XX 

SQ Sequence 431 AA; 



Query Match 81.7%; Score 1764; DB 21; Length 431; 

Best Local Similarity 79.9%; Pred. No. 2.5e-188; 

Matches 338; Conservative 37; Mismatches 38; Indels 10; Gaps 



Qy 4 NLTAEQLSALLRLHNLTRAQFIAHYGLRPLVLTPQLPARARLALLLVGMLIFALALFGNA 63 

I : I II I III I I I I I I I I I I I I I I I 11:11 I I : I I I : I I : I I I I I I I I I I I 

Db 5 NITPEQFSRLLRDHNLTREQFIALYRLRPLVYTPELPGRAKLALVLTGVLIFALALFGNA 64 

Qy 64 L WYWTRS KAMRTVTNI FI CSLALS DLLI VFFCI P VTMLQNVS DTWLGGAFI CKMVP FV 123 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 

Db 65 LVFYWTRS KAMRTVTNI FI CSLALS DLLIT FFCI PVTMLQNI S DNWLGGAFI CKMVP FV 124 

Qy 124 QCTAIVTEI LTMTCIAVERHQGLVHPFKMKRQYTNQRAFTMLGVVWLVAI I IGS PMWHVQ 183 

I I I : I I I : I I I I I I I I I I I I I I I II I I I I I I I I : I I I I I I I I I II I I : I : I I I I I I I I 

Db 12 5 Q S TAWT EMLTMT C I AVERHQ GLVH P FKMKWQ YTN RRAFTMLGWWLVAVI VG S PMWHVQ 184 

Qy 184 RLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSVLYGKIGYELWIK 243 

: I I I I I I I I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I : : : : I I I I I I I I I I I 

Db 185 QLEIKYDFLYEKEHICCLEEWTSPVHQKIYTTFILVILFLLPLMVMLILYSKIGYELWIK 244 

Qy 244 KRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIVHMMIEYSNFEKE 303 

I I : I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I : I I II I I I I I I I I I 

Db 245 KRVGDGSVLRTIHGKEMSKIARKKKRAVIMMVTWALFAVCWAP FHWHMMIEYSNFEKE 304 



Qy 304 YDEVTIKMIFAIVQIIGFFNSICNPIIYALMNENFKKNFVSAVCYCIVKETPSSARKHGS 363 

I I : I I I I I I I M I I I I I I I I I I I I I : I I I I I I I I I I : II I I I I I I : I I I : : I I : 
Db 305 YDDVTIKMIFAIVQIIGFSNSICNPIVYAFMNENFKKNVLSAVCYCIVNKTFSPAQRHGN 364 
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364 SGAMVMHRRAKLAARENPV-EIKGEAFGGSNIDIKWCEQPEKKKR RSKVA-S 413 

II : I :: I I : I I I I I I I I I I I I I : : I I I I I : I I : I I : : I : 

365 SGITMMRKKAKFSLRENPVEETKGEAFSDGNIEVKLCEQTEEKKKLKRHLALFRSELAEN 424 

414 CPL 416 
I I 

425 SPL 427 



Qy 

Db 

5 Qy 
Db 



10 



Application/Control Number: 09/990,940 
Art Umt: !646 ^ ^ ^ Pflg/$1N - £> 

J. Biol. Chem. 270, 22661-22664, 1995 

A; Title: Cloning and functional expression of a cDNA encoding a human type 2 
neuropeptide Y receptor. 

A;Reference number: 139163; MUID : 96032 678 ; PMID:7559383 
A;Accession: 139163 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A;Residues: 1-133, 1 A 1 , 135-381 <ROS> 

A;Cross-references: EMBL : U32500 ; NID : gl000750 ; PIDN : AAA93170 . 1 ; PID:gl000751 
R;Yan, H.; Yang, J.; Marasco, J.; Yamaguchi, K. ; Brenner, S.; Collins, F. ; 
Karbon, W. 

submitted to the EMBL Data Library, December 1995 
A;Reference number: H01019 
A; Accession: G02301 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 

A; Residues: 1-171, , G , ,173, 'R', 175-201, ' P', 203-208, ' A 1 , 210-381 <YAN> 
A;Cross-references: EMBL:U42389; NID : gl314329 ; PIDN : AAB07760 . 1 ; PID:gl314330 
C; Genetics : 
A; Gene: GDB : NPY2R 

A; Cross-references: GDB:4365607; OMIM:162642 

A;Map position: 4q31-4q31 

C; Superf amily : neurokinin 1 receptor 

C; Keywords: appetite; G protein-coupled receptor; glycoprotein; lipoprotein; 

thiolester bond; transmembrane protein 

F; 4 9-7 6/ Domain : transmembrane #status predicted <TM1> 

F; 8 7 -113 /Domain : transmembrane #status predicted <TM2> 

F; 166-18 6'/ Domain : transmembrane #status predicted <TM4> 

F;221-237/Domain: transmembrane #status predicted <TM5> 

F; 2 6 9-2 91 /Domain : transmembrane #status predicted <TM6> 

F; 305-32 8 /Domain : transmembrane #status predicted <TM7> 

F; 123-203/Disulf ide bonds: #status predicted 

F; 342/Binding site: palmitate (Cys) (covalent) #status predicted 

F; 372/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 20.5%; Score 442; DB 2; Length 381; 

Best Local Similarity 30.7%; Pred. No. 3.7e-30; 

Matches 96; Conservative 77; Mismatches 106; Indels 34; Gaps 



I : I I I : I I : I I : : I I : I : I I I I I I I I : I I : : II I : hi h 



9; 




Qy 


44 


Db 


50 


Qy 


104 


Db 


110 


QY 


164 


Db 


168 


Qy 


218 



I I : I : I I : I I : : I : I I I : : I 



I : I : : : : I I : : I I II I : I I : 



LVTLFLLPLLLLSVLYGKIGYELW--IKKRIGDGSVLRTIHGKEMFKIARKKKRAVIMMV 275 
I : I : : I I I : : I I : I I : I : I : I : : : : : I : I 
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Db 224 LLILYVLPLGIISFSYTRI WS KLKNHVS PGAANDH YH QRRQKTTKMLV 271 

Qy 276 TVWL FAVCWAP FH I VHMM I E Y S N F E KEYDEVTIKMI FAI VQI IGFFNSI CNPI I YA 332 

111:111111 : : : : III I : I I : II :: I I : : I 

Db 272 CWWFAVSWLPLHAFQLAVDIDSQVLDLKEY KLI FTVFHI IAMCSTFANPLLYG 32 6 

Qy 333 LMNENFKKNFVSA 345 8/9=88% identical 

II I : : I I : I I 
Db 327 WMNSNYRKAFLSA 339 



Application/Control Number: 09/990,940 
Art Unit: 1646 



AC AAY94993; 
XX 

DT 19-JUN-2000 (first entry) 

5 xx 

DE Human secreted protein vc38_l, SEQ ID NO: 26. 
XX 

KW Human; secreted protein; cancer; tumour; cardiovascular disorder; 

KW blood disorder; haemophilia; autoimmune disease; diabetes; inflammation; 

10 KW infection; fungal; bacterial; viral; HIV; allergy; arthritis; 

KW neurodegenerative disease; asthma; contraceptive. 
XX 

OS Homo sapiens. 
XX 

15 PN WO200011015-A1. 
XX 

PD 02-MAR-2000. 
XX 

PF 24-AUG-1999; 99WO-US19351 . 

20 xx 

PR 24-AUG-1998; 98US-0097638 . 

PR 24-AUG-1998; 98US-0097659 . 

PR 09-SEP-1998; 98US-0099618 . 

PR 28-SEP-1998; 98US-0102092 . 

25, PR 25-NOV-1998; 98US-0109978 . - - 

PR 23-DEC-1998; 98US-0113645 . 

PR 23-DEC-1998; 98US-0113646 . 

PR 23-AUG-1999; 99US-037 9246 . 
XX 

30 PA (ALPH-) ALPHAGENE INC. 
XX 

PI Valenzuela D, Yuan O, Hoffman H, Hall J, Rapiejko P; 
XX 

DR WPI; 2000-224657/19. 

35 xx 

PT New secreted or transmembrane proteins and polynucleotides encoding 

PT them, useful for treating neurodegenerative disorders, autoimmune 

PT diseases and cancer - 
XX 

40 PS Claim 35; Page 284-285; 357pp; English. 
XX 

CC The invention relates to 40 human secreted proteins (AAY94 981-Y95020 ) , 

CC and cDNA sequences encoding them (AAA23423-A234 62 ) . The secreted 

CC proteins of the invention include those that are thought to be only 

45 CC partially secreted, i.e., transmembrane proteins. The proteins of the 

CC invention may exhibit one or more activities selected from the following: 

CC cytokine activity; cell proliferation; differentiation; immune 

CC modulation; haematopoiesis regulation; tissue growth activity; 

CC activin/inhibin activity; chemotactic/chemokinetic activity; haemostatic 

50 CC and thrombolytic activity; anti-inflammatory activity; and tumour 

CC inhibition activity. The proteins may be administered to patients as 

CC vaccines, and the nucleotides may be used as part of a gene therapy 
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CC regime. Diseases or conditions that may be treated using the proteins or 

CC nucleotides of the invention include autoimmune diseases; genetic 

CC disorders; haemophilia; cardiovascular diseases; cancer; bacterial, 

CC fungal and viral infections, especially HIV; multiple sclerosis; 

CC rheumatoid arthritis; pulmonary inflammation; Guillain-Barre syndrome; 

CC insulin dependent diabetes mellitus; and allergic reactions such as 

CC asthma and anaemia. They may also be used for treating wounds, burns, 

CC ulcers, osteoporosis, osteoarthritis, periodontal diseases, Alzheimer ! s 

CC disease, Parkinson ! s disease, Huntington's disease and amyotrophic 

CC lateral sclerosis (ALS) . Proteins with activin/inhibin activity may 

CC additionally be useful as contraceptives. Nucleic acid sequences of the 

CC invention may be used in chromosome mapping, and as a source of 

CC diagnostic primers and probes. The present sequence represents one of the 

CC 40 proteins of the invention. 

XX 

SQ Sequence 431 AA; 

Query Match 81.8%; Score 1767; DB 21; Length 431; 

Best Local Similarity 80.1%; Pred. No. 1.2e-188; 

Matches 339; Conservative 36; Mismatches 38; Indels 10; Gaps 

NLTAEQLSALLRLHNLTRAQFIAHYGLRPLVLTPQLPARARLALLLVGMLI FALALFGNA 63 

I : I II I III I I I I I I I I I I I II I I Ihll I I : I I I : I I : I I I I I I I I I I I 
NITPEQFSRLLRDHNLTREQFIALYRLRPLVYTPELPGRAKLALVLTGVLI FALALFGNA 64 

LVVYWTRSKAMRTVTNIFICSIALSDLLIW 123 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I 11111111111:11 I I I I I I I I I I I I I I 



QCTAI VTEI LTMTCIAVERHQGLVHPFKMKRQ YTNQRAFTMLGWWLVAI 1 1 GS PMWHVQ 183 
I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : I : I I I I I I I I 
Q S TAWT E I LTMT C I AVERHQ GLVH P FKMKWQ YTNRRAFTMLGWWLVAVI VGS PMWHVQ 184 

RLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSVLYGKIGYELWIK 243 
: I I I I I I I I I II I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : : : : I I I I I I I I I I I 



KRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIVHMMIEYSNFEKE 303 

I I : II I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I 

KRVGDGSVLRTIHGKEMSKIARKKKRAVIMMVTWALFAVCWAPFHVAmMMIEYSNFEKE 304 

YDEVTIKMIFAIVQIIGFFNSICNPIIYALMNENFKKNFVSAVCYCIVKETPSSARKHGS 363 

I I : I I I I I I I I I I II I II I I I I II I : I I I I I I I I I I : M I I I I I I : I I I : : I I : 
YDDVTIKMIFAIVQIIGFSNSICNPIVYAFMNENFKKNVLSAVCYCIVNKTFSPAQRHGN 364 

SGAMVMHRRAKLAARENPV-EIKGEAFGGSNIDIKWCEQPEKKKR RSKVA-S 413 

II : I :: I I : I I I I I I I I I I I I I : : I I I I I : I I : I I : : I : 
SGITMMRKKAKFSLRENPVEETKGEAFSDGNIEVKLCEQTEEKKKLKRHLALFRSELAEN 424 



3; 




Ov 


4 


Db 


5 


Qy 


64 


Db 


65 


Qy 


124 


Db 


125 


QY 


184 


Db 


185 


QY 


244 


Db 


245 


Qy 


304 


Db 


305 


Qy 


364 


Db 


365 


Qy 


414 


Db 


425 



1 1 
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AC Q9Z2D5; 

5 DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE Neuropeptide Y receptor type 2 (NPY2-R) (NPY-Y2 receptor) . 

GN NPY2R. 

10 OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

15 RP SEQUENCE FROM N.A. 

RX MEDLINE=99017376; PubMed=9802390; 

RA Sharma P.S., Holmberg S.K., Eriksson H., Beck-Sickinger A.G., 

RA Grundemar L., Larhammar D.; 

RT "Cloning and functional expression of the guinea pig neuropeptide Y 

20 RT Y2 receptor."; 

RL Regul. Pept. 75:23-28(1998). 

CC -!- FUNCTION: RECEPTOR FOR NEUROPEPTIDE Y AND PEPTIDE YY. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 
25 CC HIGHEST TO TACHYKININS RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a 
collaboration 

30 CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation 

CC the European Bioinf ormatics Institute. There are no restrictions on 
its 

CC use by non-profit institutions as long as its content is in no 
35 way 

CC modified and this statement is not removed. Usage by and for 
commercial 

CC entities requires a license agreement (See http://www.isb- 
sib . ch/announce/ 

40 CC or send an email to license@isb-sib . ch) . 

CC --• 

DR EMBL; AF072821; AAD13143.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

45 DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; 

50 KW Phosphorylation; Lipoprotein; Palmitate. 
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FT 


DOMAIN 


1 


50 


nvmn n pt?t t tit t\ n / ti /"\m r>\Tm t t\ t \ 

EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


51 


73 


1 / nrtfTlDVTmT TV T \ 

1 (POTENTIAL) . 


FT 


DOMAIN 


74 


83 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


84 


105 


z (POTENTIAL) . 


FT 


DOMAIN 


106 


125 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


126 


147 


O / nnmrxTrnT tv t \ 

3 (POTENTIAL) . 


FT 


DOMAIN 


148 


167 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


168 


18 8 


A 1 nr\(T1T7VTfT1T TV T \ 

4 (POTENTIAL) . 


FT 


DOMAIN 


189 


215 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


216 


241 


5 (POTENTIAL) . 


FT 


<p— n rt\ jf -ft tit 

DOMAIN 


242 


o n 

2 69 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


270 


292 


6 (POTENTIAL) . 


FT 


DOMAIN 


293 


305 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


306 


329 


7 (POTENTIAL). 


FT 


DOMAIN 


330 


381 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


11 


11 


N-LINKED ( GLCNAC . . .) (POTENTIAL 


FT 


DISULFID 


124 


204 


BY SIMILARITY. 


FT 


LIPID 


343 


343 


PALM I TATE (POTENTIAL) . 


SQ 


SEQUENCE 


381 AA; 


42342 


MW; D806B859A43ECACE CRC64; 



Query Match 20.9%; Score 452; DB 1; Length 381; 

Best Local Similarity 31.6%; Pred. No. 4.5e-25; 

Matches 99; Conservative 74; Mismatches 106; Indels 34; Gaps 



Qy 44 RLALLLVGMLIFALALFGNALVVYVVTRSKAMRTW 103 

I : I : I I I : I I : I I : : I I : I : I I I I I I I I : I I : : I I I : I : I I : 

Db 51 RWL I LAYC S 1 1 L L G WGN S L VI HWI K F K S MRT VT N F F I AN LAVAD L L VN TLCLPFTLT 110 



Qy 104 QNVSDTWLGGAFICKMVPFVQCTAIVTEILTMTCIAVERHQGLVHPFKMKRQYTNQRAFT 163 

: I I : I : I I : I I : : I : I I I : : I I : : I : : : : I : I 
Db 111 YTLMGEWKMGPVLCHLVPYAQGLAVQVSTVTLTVIALDRHRCIV— YHLDSKISKQNSFL 168 

Qy 164 MLGWWLVAIIIGSPM WHVQRLEIKYDFLYEKEHICCLEEW SSPVHQKI YTTFI 217 

: : I : I : : : : I I : : I I I I I : I I : I : : : I : 

Db 169 IIGLAWGISALLASPLAIFREYSLIEIIPDF EIVACTEKWPGEEKSIYGTVYSLSS 224 



Qy 218 LVTLFLLPLLLLS VL YGKI GYELW — I KKRI GDGS VLRT I HGKEMFKI ARKKKRAVIMMV 275 

I : I : : I II : : I I I : I I : I : I : I : : : : : hi 

Db 225 LLILYVLPLGIISVSYVRI WSKLKNHVSPGAANDHYH QRRQKTTKMLV 272 

Qy 276 T VWL FAVCWAP FH I VHMMI E Y S N FE KEYDEVTIKMIFAIVQIIGFFNSICNPIIYA 332 

I I I : I I I I I I : : : : III I : I I : II : : I I : : I 

Db 273 FWWFAVS WL P LHAFQ LAVD I DS QVLDLKE Y KLI FTVFHIIAMCSTFANPLLYG 327 



Qy 333 LMNENFKKNFVSA 345 

II I : : I I : I I 
Db 32 8 WMNSNYRKAFLSA 34 0 



ID NY2R_BOVIN STANDARD; PRT; 384 AA. 

AC P79113; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 
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DE Neuropeptide Y receptor type 2 (NPY2-R) (NPY-Y2 receptor) . 

GN NPY2R. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Ammar D.A. , Kolakowski L.F. Jr., Eadie D.M., Wong D.J., Ma Y.Y., 

RA Yang-Feng T.L., Thompson D.A.; 

RL Submitted (JAN-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: RECEPTOR FOR NEUROPEPTIDE Y AND PEPTIDE YY. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 
CC HIGHEST TO TACHYKININS RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a 
collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation 

CC the European Bioinf ormatics Institute. There are no restrictions on 
its 

CC use by non-profit institutions as long as its content is in no 
way 



CC 


modified 


and this 


statement 


is not removed. 


Usage by 


commercial 










CC 


entities 


requires 


a license 


agreement (See 


http : //www. i 


sib . 


ch/announce/ 








CC 
CC 

DR 


or send an email 


to license@isb-sib . ch) . 




EMBL; U50144; AAB40600.1; -. 






DR 


InterPro; 


IPR000276; GPCR_Rhodpsn . 




DR 


Pfam; PF00001; 7tm 1; 1. 






DR 


PRINTS; PR00237; 


GPCRRHODOPSN. 




DR 


PROSITE; 


PS00237; 


G_PROTEIN_ 


RECEP_F1_1; 1. 




DR 


PROSITE; 


PS50262; 


G PROTEIN_ 


RECEP_F1_2; 1. 




KW 


G-protein 


coupled 


receptor; 


Transmembrane ; 


Glycoprotein 


KW 


Phosphorylation; 


Lipoprotein; Palmitate. 




FT 


DOMAIN 


1 


52 


EXTRACELLULAR 


(POTENTIAL) . 


FT 


TRANSMEM 


53 


75 


1 (POTENTIAL) . 




FT 


DOMAIN 


76 


85 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


86 


107 


2 (POTENTIAL) . 




FT 


DOMAIN 


108 


127 


EXTRACELLULAR 


(POTENTIAL) . 


FT 


TRANSMEM 


128 


149 


3 (POTENTIAL) . 




FT 


DOMAIN 


150 


169 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


170 


190 


4 (POTENTIAL) . 




FT 


DOMAIN 


191 


217 


EXTRACELLULAR 


(POTENTIAL) . 


FT 


TRANSMEM 


218 


243 


5 (POTENTIAL) . 




FT 


DOMAIN 


244 


271 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


272 


294 


6 (POTENTIAL) . 




FT 


DOMAIN 


295 


307 


EXTRACELLULAR 


(POTENTIAL) . 
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FT 


TRANSMEM 


308 


331 


7 (POTENTIAL) . 


FT 


DOMAIN 


332 


384 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


13 


13 


N-LINKED (GLCNAC. . .) (POTENTIAL 


FT 


DISULFID 


126 


206 


BY SIMILARITY. 


FT 


LIPID 


345 


345 


PALM I TATE (POTENTIAL) . 


SQ 


SEQUENCE 


384 AA; 


42943 


MW; 468D19CBA8F29681 CRC64 ; 



Query Match 20.8%; Score 449; DB 1; Length 384; 

Best Local Similarity 31.6%; Pred. No. 7.4e-25; 

Matches 99; Conservative 74; Mismatches 106; Indels 34; Gaps 



Qy 44 RLALLLVGMLI FAI^LFGNALWYWTRS KAMRTVTNI FI CSLALSDLLIVFFCI PVTML 103 

: : I : I I I : I I : I I : : I I : I : I I I I I I I I : I I : : I I I : hi h 

Db 53 QWLI LAYCS 1 1 LLGVI GNS LVI HWT KFKSMRTVTNFFIANLAVADLLVNTLCLPFTLT 112 



Qy 104 QNVSDTWLGGAFICKMVPFVQCTAIVTEILTMTCIAVERHQGLVHPFKMKRQYTNQRAFT 163 

: I I : I : I I : I I : : I : I I I : : I I : : I : : : : : I : I 
Db 113 YTLMGEWKMGPVLCHLVPYAQGLAVQVSTITLTVIALDRHRCIV— YHLESKI SKQISFL 170 

Qy 164 MLGWWLVAIIIGSPM WHVQRLEIKYDFLYEKEHICCLEEW SSPVHQKIYTTFI 217 

: : I : I ( : : : I I : : I I II I : I I : I : : I I : 

Db 171 1 1 GLAWGVSALLAS PLAI FREYSLI EI I PDF EIVACTEKWPGEEKGIYGTIYSLSS 226 



Qy 218 LVTLFLLPLLLLSVLYGKIGYELW — I KKRI GDGSVLRTI HGKEMFKIARKKKRAVIMMV 275 

I : I : : M I : : I I : I I : I : I : I : : : : : I : I 

Db 227 LLILYVLPLGIISFSYTRI WSKLKNHVSPGAAHDHYH QRRQKTTKMLV 274 



Qy 276 TVWLFAVCWAP FH I VHMMI E YSN FE KEYDEVTIKMI FAIVQIIGFFNSICNPI I YA 332 

I I I : I II I I I : : : : III I : I I : II : : I I : : I 

Db 275 CWWFAVSWLPLHAFQLAVDIDSHVLDLKEY KLI FTVFHIIAMCSTFANPLLYG 329 



Qy 333 LMNENFKKNFVSA 345 

II I :: I I : I I 
Db 330 WMNSNYRKAFLSA 342 



neuropeptide Y/peptide YY receptor Y2 - human 

N; Alternate names: neuropeptide y/peptide YY receptor type 2 

C; Species: Homo sapiens (man) 

C;Date: Ol-Mar-1996 #sequence_revision Ol-Mar-1996 #text_change 20-Apr-2000 
C;Accession: 139187; 139163; G02301 
R;Gerald, C. ; Walker, M.W. ; Vaysse, P.J. 
J. Biol. Chem. 270, 26758-26761, 1995 

A; Title: Expression cloning and pharmacological characterization of a human 

hippocampal neuropeptide Y/peptide YY Y2 receptor subtype. 

A; Reference number: 139187; MUID : 96070760 ; PMID: 7592910 

A; Accession: 139187 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-381 <GER> 

A;Cross-references: EMBL:U36269; NID : gl063633 ; PIDN : AAC502 81 . 1 ; PID:gl063634 
R;Rose, P.M.; Fernandes, P.; Lynch, J.S.; Frazier, S.T.; Fisher, S.M.; 
Kodukula, K. ; Kienzle, B.; Seethala, R. 
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; Patent No. 5989834 
; GENERAL INFORMATION: 

; APPLICANT: Synaptic Pharmaceutical Corporation 

TITLE OF INVENTION: NUCLEIC ACID ENCODING NEUROPEPTIDE 
TITLE OF INVENTION: Y/PEPTIDE YY (Y2) RECEPTORS AND USES THEREOF 
NUMBER OF SEQUENCES: 27 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Cooper & Dunham LLP 

; STREET: 1185 Avenue of the Americas 

; CITY: New York 

; STATE: New York 

; COUNTRY: U.S.A. 

; ZIP: 10036 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/687, 355A 

; FILING DATE: No. 5989834ember 26, 1996 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: White, John P. 
REGISTRATION NUMBER: 28,678 
; REFERENCE/DOCKET NUMBER: 447 42-A-PCT/ JPW/MAT 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-278-0400 
; TELEFAX: 212-391-0525 

; INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 381 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-08-687-355A-6 

Query Match 21.1%; Score 454.5; DB 2; Length 381; 

Best Local Similarity 30.1%; Pred. No. 3.7e-33; 

Matches 104; Conservative 75; Mismatches 119; Indels 47; Gaps 
10; 

Qy 25 IAHYGLRPLVLTPQLP ARARLAL L L VGML I FALALFGNALWYWTR 71 

: I I I : I I : : I : I I I : I I : I I : : I I : 

Db 18 VEFYGSGPTTPRGELPPDPEPELIDSTKLVEVQWLILAYCSIILLGWGNSLVI HWIK 77 

Qy 72 SKAMRTVTNI FI CS LALS DLLI VFFCI PVTMLQNVS DTWLGGAFI CKMVP FVQCTAI VTE 131 

I : I I I I I I I I : I I : : I I I : hi h : I I : I : I I : I I : 
Db 78 FKSMRTVTNFFIANLAVADLLVNTLCLPFTLTYTLMGEWKMGPVLCHLVPYAQGLAVQVS 137 

Qy 132 ILTMTCIAVERHQGLVHPFKMKRQYTNQRAFTMLGWWLVAIIIGSPM WHVQRLEIK 188 

: I : I I I : : I I : : I : : : : : I : I : : I : I I : : : I I : :|| 
Db 138 TIT LT VI AL D RH RC I V — YHLESKI SKQI S FLI IGLAWGVSALLAS PLAI FREYSLI EI I 195 
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Qy 18 9 YDFLYEKEHICCLEEW S S PVHQKI YTT FI LVTL FLLPLLLLS VLYGKI GYELW — IK 243 

II I : I I : I I : : I : I : I : : I I I : : I I : I I : I 
Db 196 PDF EIVACTEKWPGEEKSVYGTVYSLSTLLILYVLPLGIISFSYTRI WSKLK 247 

5 Qy 244 KRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVWLFAVCWAPFHIVHMMIEYSNFE — 301 

: I : I : : : : I : I I I I : I I I I I I : : : : 

Db 248 NHVSPGAASDHYH QRRHKMTKMLVCWWFAVS WL P LHAFQLAVD IDS HVLD 299 

Qy 302 -KEYDEVTIKMIFAIVQIIGFFNSICNPIIYALMNENFKKNFVSA 345 

10 Ml I : I I : II : : I I : : I I I I : : I I : I I 

Db 300 LKEY KLI FTVFHI IAMCSTFANPLLYGWMNSNYRKAFLSA 339 



15 
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; Sequence 2, Application US/09899532 

; Patent No. US2002 004 87 91A1 

; GENERAL INFORMATION: 

; APPLICANT: Bloomquist, Brian T. 

; APPLICANT: Zhelnin, Leonid 

; TITLE OF INVENTION: Human Neuropeptide Y-Like G 

TITLE OF INVENTION: Protein-Coupled Receptor 

; FILE REFERENCE: 02973.00040 

; CURRENT APPLICATION NUMBER: US/09/8 99,532 

; CURRENT FILING DATE: 2001-07-06 

; PRIOR APPLICATION NUMBER: US 60/216,523 

; PRIOR FILING DATE: 2000-07-06 

; NUMBER OF SEQ ID NOS : 3 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 2 
; LENGTH: 431 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-899-532-2 



Query Match 81.8%; Score 1767; DB 9; Length 431; 

Best Local Similarity 80.1%; Pred. No. 7.3e-156; 

Matches 339; Conservative 36; Mismatches 38; Indels 10; Gaps 



Qy 4 NLTAEQLSALLRLHNLTRAQFIAHYGLRPLVLTPQLPARARLALLLVGMLIFALALFGNA 63 

I : I II I III I I I I I I I I I I I I I I I Ihll I I : I I I : I I : I M I I I I I I I I 

Db 5 N I T P EQ FS RL LRDHNLT REQ FI AL YRL RP LVYT PEL P GRAKLALVLT GVL I FALAL FGNA 64 

Qy 64 LWYWTRSKAMRTVTNIFICSIALSDLLIVFFCIPVTMLQNVSDTWLGGAFICKMVPFV 123 

II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I : I I I I II I I I I I I I I I I 

Db 65 LVFY\A/TRSKAMRTVTNIFICSLALSDLLITFFCIPVTMLQNISDNWLGGAFICKMVPFV 124 



Qy 124 QCTAIVTEILTMTCIAVERHQGLVHPFKMKRQYTNQRAFTMLGVWLVAIIIGSPMWHVQ 183 

I I I : I I I I I I I I I II I I I I I I I I I I II M I I I I : I I I I I I I I I I I I I : I : I M I I I I I 

Db 125 QSTAWTEILTMTCIAVERHQGLVliPFKMKWQYTNRRAFTMLGvWLVAVIVGSPMWHVQ 184 

Qy 184 RLEI KYDFLYEKEHI CCLEEWS S PVHQKI YTTFI LVTLFLLPLLLLSVLYGKI GYELWI K 243 

: I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : : : : I I I I I I II I I I 

Db 185 QLEI KYDFLYEKEHI CCLEEWTS PVHQKI YTTFI LVILFLLPLMVMLILYSKI GYELWI K 244 

Qy 244 KRIGDGSVLRTIHGKEMFKIARKKKRAVIMMVTVVVLFAVCWAP FHIVHMMIEYSNFEKE 303 

I I : II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I II I I 

Db 245 KRVGDGSVLRTIHGKEMSKIARKKKRAVIMMVTWALFAVCWAP FHWHMMIEYSNFEKE 304 



Qy 304 YDEVTIKMIFAIVQIIGFFNSICNPIIYALMNENFKKNFVSAVCYCIVKETPSSARKHGS 363 

I I : I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I : I I I II I I I : I I I : : I I : 
Db 305 YDDVT I KMI FAI VQI I GFSNS I CNPI VYAFMNENFKKNVLSAVCYCI VNKT FS PAQRHGN 364 



Qy 364 SGAMVMHRRAKLAARENPV-EIKGEAFGGSNIDIKWCEQPEKKKR RSKVA-S 413 

II : I : : I I : I I I I I I I I I I I I I : : I I I I I : M : I I : : I : 

Db 365 SGI TMMRKKAKFS LRENPVEETKGEAFSDGNI EVKLCEQTEEKKKLKRHLALFRS ELAEN 424 
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RESULT 5 

US-10-272-983-38 

; Sequence 38, Application US/10272983 

; Publication No. US20030148450A1 

; GENERAL INFORMATION: 

; APPLICANT: Chen, Ruoping 

; APPLICANT: Dang, Huong T. 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 
; FILE REFERENCE: AREN0050 

; CURRENT APPLICATION NUMBER: US/10/272 , 983 

; CURRENT FILING DATE: 2002-10-17 

; PRIOR APPLICATION NUMBER: US/09/417 , 044 

; PRIOR FILING DATE: 1999-10-12 

; PRIOR APPLICATION NUMBER: 60/109,213 

; PRIOR FILING DATE: 1998-11-20 

; PRIOR APPLICATION NUMBER: 60/120,416 

; PRIOR FILING DATE: 1999-02-16 

; PRIOR APPLICATION NUMBER: 60/121,851 

; PRIOR FILING DATE: 1999-02-26 

; PRIOR APPLICATION NUMBER: 60/123,946 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,949 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/136,436 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,437 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,439 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,567 

; PRIOR FILING DATE: 1999-05-28 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 7 4 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 38 

LENGTH: 431 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-272-983-38 

Query Match 81.7%; Score 1764; DB 12; Length 431; 

Best Local Similarity 79.9%; Pred. No. 1.4e-155; 

Matches 338; Conservative 37; Mismatches 38; Indels 10; Gaps 

3; 

Qy 4 NLTAEQL SALLRLHNLT RAQ FI AH YGLRPLVLT PQL PARARLALLLVGMLI FALAL FGNA 63 

I : I II I III I I I I I I II I I I I I I I I I : I I I I : I I I : I I : I I I I I I I I I I I 

Db 5 NITPEQFSRLLRDHNLTREQFIALYRLRPLVYTPELPGRAKLALVLTGVLIFALALFGNA 64 

Qy 64 LVVYvWRSKAMRTVTNIFICSLALSDLLIVFFCIPVTMLQNVSDTWLGGAFICKMVPFV 123 

II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
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Db 65 LVFYVVTRSKAMRTWNIFICSLALSDLLITFFCIPVTMLQNISDNWLGGAFICKMVPFV 124 

Qy 124 QCTAI VTEI LTMTCIAVERHQGLVHPFKMKRQ YTNQRAFTMLGVVWLVAI 1 1 GS PMWHVQ 183 

I I I : I I I : I I I I I I I I I I I I I II I I I I I I I I I I : I I I I I I I I I I I I I : I : I I I I I I I I 
Db 125 QSTAVWEMLTMTCIAVERHQGLVHPFKMKWQYTNRRAFTMLGVVWLVAVIVGS PMWHVQ 184 

Qy 184 RLEIKYDFLYEKEHICCLEEWSSPVHQKIYTTFILVTLFLLPLLLLSVLYGKIGYELWIK 243 

: I I I I I I I I I I I I I I I I I I M : I I I I I I I I I I I I I I I I I I I I : :: : I I I M I I I I I I 
Db 185 QLEIKYDFLYEKEHICCLEEWTSPVHQKIYTTFILVILFLLPLMVMLILYSKIGYELWIK 244 

Qy 244 KRIGDGSVLRTIHGKEMFKIARKKKRAVII^VTVWLFAVCWAPFHIVHMMIEYSNFEKE 303 

I I : I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I I I I I 
Db 245 KRVGDGSVLRTIHGKEMSKIARKKKRAVIMIWTVVALFAVCWAPFHVVHMMIEYSNFEKE 304 

Qy 304 YDEVTIKMIFAIVQIIGFFNSICNPIIYALMNENFKKNFVSAVCYCIVKETPSSARKHGS 363 

I I : I I I I I I I I I I I I I I I I I I I I I I : I I I I I II I I I : I I I I II I I : I I I : : I I : 

Db 305 YDDVT I KMI FAIVQI I GFSNS I CNPI VYAFMNENFKKNVLSAVCYCI VNKTFS PAQRHGN 364 

Qy 364 S GAMVMHRRAKLAARENPV- EI KGEAFGGSNI DI KWCEQPEKKKR RSKVA-S 413 

II : I : : I I : I I I I I I I I I I I I I : : I I I I I : I I : I I : : I : 

Db 365 SGITMMRKKAKFSLRENPVEETKGEAFSDGNIEVKLCEQTEEKKKLKRHLALFRSELAEN 424 

Qy 414 CPL 416 

I I 

Db 425 SPL 427 



LOCUS 
2002 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

genes 

JOURNAL 



AK048439 



2864 bp 



mRNA 



linear HTC 05-DEC- 



Mus musculus 16 days embryo head cDNA, RIKEN full-length enriched 
library, clone : C130060K24 product : hypothetical Rhodopsin-like GPCR 
superfamily containing protein, full insert sequence. 
AK048439 

AK048439. 1 GI: 26339331 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki , Y . 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new 

Genome Res. 10 (10), 1617-1630 (2000) 
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MEDLINE 
PUBMED 
REFERENCE 

AUTHORS 
Carninci, P . 



Kawai, J. , 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



20499374 
11042159 
3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., 

Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M. , 
Sumi,N., Ishii,Y., Nakamura, S . , Hazama,M. , Nishine,T., Harada,A. , 
Yamamoto, R. , Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K. , Togawa,Y., Izawa,M. , Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., 

Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki , Y. 

RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

Genome Res. 10 (11), 1757-1771 (2000) 

20530913 

11076861 

4 

Kawai , J. , 
Arakawa, T , 
Aizawa, K. , 
Saito,T. , 



Nordone, P . , 



Suzuki, H. , 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 

AUTHORS 
Carninci, P . 



Shinagawa, A. , Shibata,K., Yoshino,M., Itoh,M., Ishii,Y., 
, Hara,A., Fukunishi, Y. , Konno,H., Adachi,J., Fukuda,S., 
Izawa,M., Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka,I., 
Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito,R., 



Kadota,K., Matsuda,H., Ashburner , M. , Batalov,S., Casavant,T., 
Fleischmann, W. , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush, J. , Schriml , L.M. , Staubli,F., Suzuki, R. , Tomita,M., 
Wagner, L. , Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli, R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci, P., de Bonaldo, M. F . , Brownstein, M. J. , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi,M. , Gus tincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee, N . H . , Lyons, P., 
Marchionni, L. , Mashima, J. , Mazzarelli, J. , Mombaerts, P . , 

Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto,N., Sasaki, H., 
Sato,K., Schonbach, C. , Seya,T., Shibata,Y., Storch,K.F., 

Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C . , Wilming,L., 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821), 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 2864) 

Adachi,J., Aizawa, K., Akimura,T., Arakawa, T., Bono,H., 

Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
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TITLE 
JOURNAL 



Yokohama, 



COMMENT 



FEATURES 

source 



library" 
CDS 



Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M. , 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura, M. , Nishi,K., Nomura, K. , Numazaki,R., Ohno,M. , Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa,A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi , A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, 

Kanagawa 230-0045, Japan ( E-mail : genome-res @gsc . riken . go . jp, 
URL :http: //genome. gsc. riken. go. jp/, Tel : 81-45-503-9222 , 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL:http: //genome . gsc . riken. go. jp/ 
URL: http : //fantom. gsc. riken. go. jp/ . 

Location/ Qualifiers 

1. .2864 

/organism =,, Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref= n FANTOM_DB:C130060K24" 

/db_xref="taxon: 10090" 

/clone="C130060K24" 

/ tissue_type="head n 

/clone_lib="RIKEN full-length enriched mouse cDNA 

/dev_stage="16 days embryo" 
64. .1161 

/note="unnamed protein product; hypothetical 
Rhodopsin-like GPCR superfamily containing protein 
(InterProl IPR000276, evidence: InterPro) 
putative" 
/codon_start=l 
/protein_id="BAC33337 . 1" 
/db xref="GI:26339332" 



/ trans la tion="MLIFAIALFGNALVVYVWRSPCAMRTVTNIFICSLALSDLLIVF 



FCIPVTMLQNVSDTWLGGAFICKMVPFVQCTAIVTEILTMTCIAVERHQGLVHPFKMK 
RQYTNQRAFTMLGWWLVAI I IGS PMWHVQRLEI KYDFLYEKEHI CCLEEWSS PVHQK 
IYTTFILVTLFLLPLLLLSVLYGKIGYELWIKKRIGDGSVLRTIHGKEMFKIARKKKR 
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AVIMMVTVWLFAVCWAPFHIWMMIEYSNFEKEYDEVTIKMIFAIVQIIGFFNSICN 

PIIYALMNENFKKNFVSAVCYCIVKETPSSARKHGSSGAMVMHRRAKIAARENPVEIK 
5 GEAFGGSN I D I KWCEQ P EKKKRRS KVAS CPL" 

BASE COUNT 839 a 609 c 602 g 814 t 

ORIGIN 

Query Match 94.9%; Score 1681; DB 11; Length 2864; 

10 Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1681; Conservative 0; Mismatches 0; Indels 0; Gaps 

0; 

Qy 92 GGCCACTGGTGCTCACCCCGCAGCTTCCCGCGCGCGCCAGGCTGGCCCTCCTGCTGGTCG 151 

15 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 2 GGCCACTGGTGCTCACCCCGCAGCTTCCCGCGCGCGCCAGGCTGGCCCTCCTGCTGGTCG 61 

Qy 152 GCATGCTCATCTTTGCCCTGGCGCTCTTCGGCAACGCCCTGGTAGTCTATGTGGTGACCC 211 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
20 Db 62 GCATGCTCATCTTTGCCCTGGCGCTCTTCGGCAACGCCCTGGTAGTCTATGTGGTGACCC 121 

Qy 212 G CAGCAAGGC C AT G C GCAC C GT CAC CAACAT CTT CAT CTGCTCCCTG GC ACT CAGC GAC C 271 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 122 GCAGCAAGGCCATGCGCACCGTCACCAACATCTTCATCTGCTCCCTGGCACTCAGCGACC 181 

25 

Qy 272 TGCTCATCGTCTTCTTCTGCATCCCGGTCACCATGCTCCAGAACGTCTCGGACACCTGGC 331 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 182 TGCTCATCGTCTTCTTCTGCATCCCGGTCACCATGCTCCAGAACGTCTCGGACACCTGGC 241 

30 Qy 332 TGGGGGGTGCCTT CAT T T GCAAAAT GGT C C CATT T GT CCAGT GCACT GC CATT GT GACAG 3 91 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 242 T G G GGGGT G C CT T CAT T T GCAAAAT G GT C C CATT T GT CCAGT GCACT GC CATT GT GACAG 301 

Qy 392 AAATCCTTACTATGACCTGCATTGCTGTGGAAAGGCACCAGGGACTTGTCCATCCTTTTA 4 51 

35 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 302 AAATCCTTACTATGACCTGCATTGCTGTGGAAAGGCACCAGGGACTTGTCCATCCTTTTA 361 

Qy 4 52 AAAT GAAGC GGCAGT ACAC CAAT CAAAGAG CT TT CACAAT GC T AG GT GT GGT GT G G CT GG 511 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
40 Db 362 AAAT GAAGC G GCAGT ACAC CAAT CAAAGAG CTTT CACAAT GCT AG GT GT GGT GT G G CT GG 421 

Qy 512 T G GC CAT CAT CATAGGAT CAC C CAT GT GG CAT GT G CAGC GACT T GAGAT T AAGT AT GACT 571 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 422 T GGCCAT CAT CATAGGATCACCCATGT GGCAT GT GCAGCGACTT GAGATTAAGTAT GACT 4 81 

45 

Qy 572 T C CT AT AT GAAAAAGAACACAT CT GCT GC C T G GAAGAGT G GAGC AGC CC C GT GCAC CAGA 631 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 82 T C CT AT AT GAAAAAGAACACAT CTGCTGCCTG GAAGAGT G GAG C AG C C C C GT G CAC CAGA 541 

50 Qy 632 AGATCTACACCACCTTCATCCTTGTCACCCTCTTCCTGCTACCACTGTTGCTGCTCTCTG 691 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 542 AGATCTACACCACCTTCATCCTTGTCACCCTCTTCCTGCTACCACTGTTGCTGCTCTCTG 601 



Qy 



692 



T C CT CT AC G G GAAAAT C G GT TAT GAG CTT T G GAT CAAGAAAAGAAT C GG G GAT GGCT CAG 751 
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Db 602 T C C T CT AC GG GAAAAT C G GT T AT GAGCTT T G GAT CAAGAAAAGAAT C G G G GAT GG C T CAG 661 

Qy 752 T GCT C CGAACTAT T CAT GGAAAAGAAATGTT CAAAATAGC CAGAAAGAAGAAGCGAGCT G 811 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 662 T GCT C CGAACTATT CAT GGAAAAGAAATGTT CAAAATAGCCAGAAAGAAGAAGCGAGCT G 721 

Qy 812 TGATCATGATGGTGACAGTCGTGGTTCTCTTTGCTGTGTGCTGGGCACCTTTCCACATCG 871 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 722 TGATCATGATGGTGACAGTCGTGGTTCTCTTTGCTGTGTGCTGGGCACCTTTCCACATCG 781 

Qy 872 T T CACAT GAT GAT T GAAT ACAGT AAT T T T GAAAAGGAAT AT GAT GAAGT CACAAT CAAGA 931 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 782 T T CACAT GAT GAT T GAATACAGT AAT T TT GAAAAGGAAT AT GAT GAAGT CACAAT CAAGA 841 

Qy 932 T GAT T T T T GC T AT AGT GCAAATAAT T GGAT T T T T CAAC T C CAT CT GTAAT C C CAT TAT T T 991 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 842 T GAT T T T T GCT AT AGT GCAAATAAT T G GAT T T T T CAACT C CAT CT GTAAT C C CAT T AT TT 901 

Qy 992 ATGCACTTATGAATGAAAACTTCAAAAAAAACTTTGTGTCTGCCGTTTGCTATTGCATTG 

1051 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 902 ATGCACTTATGAATGAAAACTTCAAAAAAAACTTTGTGTCTGCCGTTTGCTATTGCATTG 961 

Qy 1052 T GAAGGAAAC AC CT T CT T CAGCAC G GAAGCAT GGAAGT T CAG GAGCT AT G GT GAT GCAC A 

1111 

I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 962 T GAAGGAAAC AC CT T CT T CAGCAC G GAAG CAT GGAAGT T CAGGAGC T AT GGT GAT GCAC A 

1021 

Qy 1112 G GAGGGCAAAGT T AGCT GCAAGAGAGAAT C CT GT AGAGAT CAAAGGAGAAGC ATTT G G GG 

1171 

I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I II II I M II I I I I I 
Db 1022 GGAGG GCAAAGT TAG CT G CAAGAGAGAAT C C T GT AGAGAT CAAAGGAGAAG CATT T GG G G 

1081 

Qy 1172 G CAGCAACAT C GAT AT CAAGT GGT GT GAACAG C CAGAAAAGAAGAAGAGGAGAT CAAAAG 

1231 

I I I I I I I I I I II I I I II II I II I I I II I I I I I I I II I I I I II M I I I I II I I I I I II I I I 
Db 1082 GCAGCAACAT C GAT AT CAAGT G GT GT GAACAGC CAGAAAAGAAGAAGAGGAGAT CAAAAG 

1141 

Qy 1232 TGGCATCTTGTCCTCTTTAGTTCCGAATTTCTGAGAGCTCTGCTGTAGACGTGAACACTG 

1291 

I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I II I I I I I I I 
Db 1142 TGGCATCTTGTCCTCTTTAGTTCCGAATTTCTGAGAGCTCTGCTGTAGACGTGAACACTG 

1201 

Qy 1292 T AC CAAT GT C T T CAGAAT GAGT AT C T GT CAT ACT GTAAT C GAAAGAAAAT GAT T T T GAGA 

1351 

I II I I II I I I I I I I I I I I I I II I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II 
Db 1202 T AC CAAT GT CT T CAGAAT GAGT AT CT GT CAT ACT GTAAT C GAAAGAAAAT GAT T T T GAGA 

1261 
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Qy 1352 AAAAGC C AGAGAG C TT T CAT AT TAAAAAT GT T GACAAACACT CAGAAGGCAG GGACAGG G 

1411 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
• Db 12 62 AAAAGCCAGAGAGCTTT CATATTAAAAAT GTT GACAAACACT CAGAAGGCAG GGACAGGG 

5 1321 

Qy 1412 GAT T CAAGAGTT TAAAGT CAT C CT T AG CT GCAC GATAAGT TT GAGGATAAC CT GG GCT AC 

1471 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
10 Db 1322 GAT T CAAGAGT T TAAAGT CAT C CT T AG CT G CAC GATAAGT TT GAG GATAAC CT G G G CT AC 

1381 

Qy 1472 AAGAGAC C CT GT CT CAAGAAG C C ATAATAAT TAAAACAAC CAT C CT T AACT AAT GATAAT 

1531 

15 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1382 AAGAGAC C CT GT CT CAAGAAGC CATAATAAT TAAAACAAC CAT C CT TAACTAATGATAAT 

1441 

Qy 1532 GACAAAGT AT T T T T C CAT T GAAAAT AC AT GT AAG C T G C AAT T T T GAAAAAT TAT T GAAC C 

20 1591 

I I I I I II I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1442 GACAAAGTATTTTTCCATTGAAAATACATGTAAGCTGCAATTTTGAAAAATTATTGAACC 

1501 

25 Qy 1592 AC C CT T GT GAT T AATAGAT GAAGT T T AAAAAAAT T T AAAT GT GT T T T TAT T GTAT GT AT A 

1651 

I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I 
Db 1502 AC C CT T GT GATTAAT AGAT GAAGT T TAAAAAAAT T TAAAT GT GT T T T TAT T GTAT GTAT A 

1561 

30 

Qy 1652 TGGTTGGTT T AC C T GT GTAT AT GT CT T T CAGTAACT T GT ATAAAACT CAAT GAT CT CAGC 

1711 

I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1562 TGGTTGGTT T AC C T GT GTAT AT GT CT T T CAGTAACT T GT ATAAAACT CAAT GAT CT CAG C 

35 1621 

Qy 1712 T AGTAACT T TCTTCTGTGTGGT CAAT GT GAT AT GAT T T C CT AT AT AT T GCTAAAT T GAAT 

1771 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
40 Db 1622 T AGTAACT T TCTTCTGTGTGGT CAAT GT GAT AT GAT T T C CT AT AT AT T GCTAAAT T GAAT 

1681 

Qy 1772 G 1772 

I 

45 Db 1682 G 1682 
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RESULT 5 

US-10-272-983-38 

; Sequence 38, Application US/10272983 

; Publication No. US20030148450A1 

; GENERAL INFORMATION: 

; APPLICANT : Chen , Ruoping 

; APPLICANT: Dang, Huong T. 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I -Lin 

; TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 
; FILE REFERENCE: AREN0050 

; CURRENT APPLICATION NUMBER: US/10/272 , 983 

; CURRENT FILING DATE: 2002-10-17 

; PRIOR APPLICATION NUMBER: US/09/417 , 044 

; PRIOR FILING DATE: 1999-10-12 

; PRIOR APPLICATION NUMBER: 60/109,213 

; PRIOR FILING DATE: 1998-11-20 

; PRIOR APPLICATION NUMBER: 60/120,416 

; PRIOR FILING DATE: 1999-02-16 

; PRIOR APPLICATION NUMBER: 60/121,851 

; PRIOR FILING DATE: 1999-02-26 

; PRIOR APPLICATION NUMBER: 60/123,946 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,949 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/136,436 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,437 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,439 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,567 

; PRIOR FILING DATE: 1999-05-28 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 74 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 38 

; LENGTH: 431 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-272-983-38 

Query Match 81.7%; Score 1764; DB 12; Length 431; 

Best Local Similarity 79.9%; Pred. No. 1.4e-155; 

Matches 338; Conservative 37; Mismatches 38; Indels 10; Gaps 

3; 

Qy 4 NLTAEQLSALLRLHNLTRAQFIAHYGLRPLVLTPQLPARARLALLLVGMLIFALALFGNA 63 

I : I II I III I I I I I I I I I I I I I I I Ihll I I : I I I : I I : I I I I I I I I I I I 

Db 5 NITPEQFSRLLRDHNLTREQFIALYRLRPLVYTPELPGRAKLALVLTGVLIFALALFGNA 64 

Qy 64 LWYVVTRSKAMRTWNIFICSLALSDLLIVFFCIP 123 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
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Db 


65 


LVFYVVTRSKAMRTWNIFICS]J\LSDLLITFFCIPWMLQNISDNWLGGAFICKWPFV 


124 




Qy 


1 O A 

12 4 


QCTAlVTEILTMTCIAVEKnQGLVnPr KMKKy Y 1 JNyKAr INijCjVVWJjVAllltjbFJyiWrlvy 


loo 








i i i • i i i • i i i i i i i i i 1 t i 1 1 1 1 1 i 1 1 1 1 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 • 1 • 1 1 1 1 1 1 1 1 
1 1 1 : 1 1 1 : 1 1 II ! 1 1 1 1 M 1 1 1 1 1 M 1 1 1 I 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 • I * 1 1 1 1 M 1 1 




5 


Db 


125 


QSTAVVTEMLTMTCIAVERHQGLVHPFKMKWQYTNRRAFTMLGVWLVAVIVGSPmHVQ 


184 




Qy 


T O A 

184 


RLEIKYDr LYEKEnlCCLEEWSSFVnQKl Yl IrlLVlLr LLr ItLblib Vlii tarvl lax bliWl r\ 


O A "5 
£ Q O 








• i i i i i i i i i i i 1 i i i 1 1 1 1 1 * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • • • • 1 I 1 1 1 1 1 1 1 1 I 
: 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • • • -II II 1 1 1 1 1 II 




10 


Db 


185 


QLEI KYDFLYEKEHI CCLEEWTS PVHQKI YTTFILVILFLLPLMVMLI LYSKI GYELWI K 


244 




Qy 


244 


KRI GDGSVLRT I HGKEMFKIARKKKRAVIMMVTVVVLF 








1 1 . 1 1 1 t 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 : 1 1 1 II 1 1 I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 






Db 


245 


KRVGDGSVLRTIHGKEMSKIARKKKRAVIMMVTWALFAVCWAPFHVVHMMIEYSNFEKE 


304 


15 


Qy 


304 


YDEVTIKMIFAIVQIIGFFNSICNPIIYALMNENFKKNFVSAVCYCIVKETPSSARKHGS 


363 








1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 : 1 1 1 : : 1 1 : 






Db 


305 


YDDVTIKMI FAIVQIIGFSNSICNPIVYAFMNENFKKNVLSAVCYC 






Qy 


364 


SGAMVMHRRAKLAARENPV-EIKGEAFGGSNIDIKWCEQPEKKKR RSKVA-S 


413 


20 




II : 1 : : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 : : 1 1 1 1 1 : 1 1 : 1 1 : : 1 : 






Db 


365 


SGITMMRKKAKFSLRENPVEETKGEAFSDGNIEVKLCEQTEEKKKLKRHLALFRSELAEN 


424 




Qy 


414 


CPL 416 




25 


Db 


425 


1 1 

SPL 427 





